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ABSTRACT 

We used a genetic screen based on tRNA-mediated 
suppression (TIVIS) in a Schizosaccharomyces 
pombe La protein (Sla1p) mutant. Suppressor 
pre-tRNA®^'UCA-C47:6U with a debilitating substitu- 
tion in its variable arm fails to produce tRNA in a 
sla1-rrm mutant deficient for RNA chaperone-like 
activity. The parent strain and spontaneous mutant 
were analyzed using Solexa sequencing. One syn- 
onymous single-nucleotide polymorphism (SNP), 
unrelated to the phenotype, was identified. Further 
sequence analyses found a duplication of the 
tRNA®®'^UCA-C47:6U gene, which was shown to 
cause the phenotype. Ninety percent of 28 isolated 
mutants contain duplicated tRNA®^'^UCA-C47:6U 
genes. The tRNA gene duplication led to a dispro- 
portionately large increase in tRNA®^'^UCA-C47:6U 
levels in sla1-rrm but not sla1-null cells, consist- 
ent with non-specific low-affinity interactions 
contributing to the RNA chaperone-like activity of 
La, similar to other RNA chaperones. Our analysis 
also identified 24 SNPs between ours and S. pombe 
972h- strain yFS101 that was recently sequenced 
using Solexa. By including mitochondrial (mt) DNA 
in our analysis, overall coverage increased from 



52% to 96%. mtDNA from our strain and yFS101 
shared 14 mtSNPs relative to a 'reference' mtDNA, 
providing the first identification of these S. pombe 
mtDNA discrepancies. Thus, strain-specific and 
spontaneous phenotypic mutations can be 
mapped in S. pombe by Solexa sequencing. 

INTRODUCTION 

tRNA gene number and arrangement are of interest to 
genome biologists (1-7) because tRNA abundance is 
matched to codon usage (3) and tRNA genes affect 
nuclear and genome organization [(8,9), reviewed in 
ref. 10, and see ref. 11]. Biased codon usage in function- 
ally related mRNAs suggests that the relative levels of 
tRNA isoacceptors may reflect a means of genetic 
control (12-15). Most eukaryotes contain a variable 
number of tRNA genes, from ~200 to several thousand 
(4). Yet, in humans as in species from yeast to dog, about 
12 codons have no tRNA with direct Watson:Crick 
basepairing (Supplementary Table and ref. 4), and must 
rely on wobble decoding. Further, the ratio of different 
isoacceptor tRNA genes can vary significantly between 
species (16) (Supplementary Table SI). Thus, eukaryotic 
evolution has been accompanied by highly variable prolif- 
eration of certain tRNA genes while many others are 
absent or extremely underrepresented. 
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Promoters for eukaryotic tRNA genes are all quite 
similar and provide similar transcription output. As 
such, differences in tRNA levels usually occur by 
varying tRNA gene copy number (17) which appears to 
coevolve with codon usage (3,5). Sequence identity 
suggests that multicopy tRNA genes arise by duplication 
and are maintained by gene conversion [(4), reviewed in 
refs 10,18 and 19]. Chromatin at eukaryotic tRNA loci 
contains RNA polymerase III (pol 111) and other factors 
including the RNA-binding, molecular chaperone protein, 
La (20,21). Transcription termination by pol III generates 
UUU-3'OH ends on its nascent RNA products (22,23) 
which is a binding site for La (24-26) and thus directs 
early RNA processing events (27). 

Maturation of a nascent pre-tRNA to a mature func- 
tional molecule requires many enzymatic and 
carrier-mediated processes (28). Inherent variabihty in 
the sequence and structure of different tRNAs suggests 
that they will differ in their activity as substrate for a 
particular enzyme, binding protein, or process (27). 
Indeed, significant variabihty in the order of processing 
and modification steps exists for different tRNAs (27). 
In the collective 'pathway' of tRNA maturation some 
events differ in the extent to which they are 
concentration-driven, kinetics-limited, spatially ordered 
and amenable to alternate mechanisms (27). Processes 
that degrade pre-tRNAs, such as nuclear surveillance, 
compete with some of the productive steps (27). 
Accordingly, pre-tRNAs that are not efficiently engaged 
by productive processing become subjected to surveillance 
systems that actively degrade hypomodified, improperly 
processed or structurally impaired pre-tRNAs (28-31). 
As a chaperone for pol III transcripts. La protein can 
offset this type of decay (27,32). Thus, the course for a 
pre-tRNA can vary depending on if it is engaged by La 
and/or other factors (27,28). Moreover, recent findings 
that unprocessed pre-tRNAs can activate stress signals 
(33,34) that can impact the small interfering RNA 
(siRNA) (35) or other pathways (31,36^0) suggest that 
our understanding of tRNA metabohsm is incomplete. 

While La serves to facihtate pre-tRNA processing, it is 
nonessential in yeasts (24). La contains two conserved 
RNA binding motifs, the La motif (LAM) and an 
adjacent RRMl that act differently during tRNA matur- 
ation (41). Two types of chaperone activity have been 
attributed to La proteins, molecular chaperone activity 
and RNA chaperone activity (27). Its principal 
RNA-binding activity, UUU-3'-0H binding, is mediated 
primarily by the LAM to protect against 3' exonucleases 
and promote 3' end processing and RNP maturation. 
Mutations in human La (hLa) or Schizosaccharomyces 
pombe La, Slalp, that disrupt RNA 3' end binding or 
intracellular trafficking cause tRNA processing defects 
consistent with dysfunctional molecular chaperone 
activity (42,43). 

The RNA chaperone-hke activity noted for human and 
yeast La proteins (44-47) is an enigmatic aspect of La 
function that is discernible from UUU-3'OH binding. La 
promotes the maturation of pre-tRNAs with structural 
impairments that otherwise subject them to misfolding 
and degradation (32,46). The typical p-sheet surface and 



associated loop-3 of La RRMl constitute an 
RNA-binding site that confers RNA chaperone-like 
activity (32,47). Two classes of mutations in La RRMl 
have been distinguished. Mutation of basic residues in 
RRMl loop-3 do not impair UUU-3'OH binding but sig- 
nificantly decrease binding to other parts of the 
pre-tRNA, decrease RNA chaperone activity and are det- 
rimental to tRNA maturation (47). Mutation of the two 
invariant aromatic residues on the RRMl P-sheet surface 
debihtates hLa and Slalp for maturation of structurally 
impaired pre-tRNAs in vivo, but with too subtle a defect in 
RNA binding for detection in vitro (32). Nonetheless, 
these p-sheet surface aromatic mutations inactivate hLa 
and Slalp for maturation of structurally impaired 
pre-tRNA^" even in cells that lack Rrp6p (32), the 3' exo- 
nuclease component of nuclear surveillance that degrades 
one class of structurally impaired pre-tRNAs (30,32,48). 
This observation suggested a novel, rr/>(5-independent deg- 
radation pathway for structurally impaired pre-tRNA " 
in the slal-rrml mutants (32). 

To try to better understand this, we developed a genetic 
screen in 5. pombe that relies on accumulation of a func- 
tional suppressor tRNA^'"UCA-C47:6U despite a 
debihtating substitution of a G:C to a G:U basepair in 
its variable arm (Figure lA). In cells carrying wild-type 
Slalp, the pre-tRNA^"UCA-C47:6U from this allele is 
matured and functionally suppresses a nonsense codon 
in ade6-704, alleviating accumulation of red pigment. 
However, this allele does not produce tRNA in the 
slal-rrml mutant carrying the point mutations in the 
RRMl P-sheet surface at levels sufficient to mediate 
TMS (32). We isolated spontaneous revertants that 
restore tRNA^*^' -mediated suppression in slal-rrm cells 
and mapped the phenotypic mutation by whole genome 
sequencing. We estabhsh that this approach is applicable 
to S. pombe and report that the phenotypic mutation is 
duphcation of the suppressor tRNA^" gene. We show that 
a surprising disproportionate increase in suppressor- 
tRNA levels that occurs with tRNA gene duplication 
requires slal-rrm, provoking a model in which activation 
by La of tRNA maturation is dependent on the concen- 
tration of the pre-tRNA. 

In the process of genome analysis, we also uncovered 
24 strain-specific single-nucleotide polymorphism (SNPs) 
in the genomes of our laboratory strain and the reference 
strain 972h- (yFSlOl) which were separated ~30 years ago 
and passed through multiple laboratories. 

MATERIALS AND METHODS 

Strains 

Strains are listed in Table 3. slal-Y 157 A or slal-Y 157 A/ 
F201A RRMl mutations were introduced into ySH18 
which harbors a copy of tRNA^'='UCA-C47:6U and 
associated pJK148 DNA at the leul-32 locus as described 
(49) to generate yMWB2-4 (test strain) and yMWB3-15 
(parent), respectively. yMWB3-15 was plated on media 
containing limiting (lOmg/1) adenine, and spontaneously 
arising white colonies (e.g. yDA317) were isolated after 
3 days of growth and characterized as described. 
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LAM RRM1 

Figure 1. The Slalp and suppressor-tRNA^"UCA-C47:6U compo- 
nents of the yMWB3-15 Strain. (A) Precursor (left) and mature 
(right) suppressor tRNA^"''UCA-C47:6U harbors a C47:6U mutation 
on the variable arm indicated by a blue circle and arrow. The three 
anticodon residues are boxed. The intron in the precursor is 
demarcated by the solid lines at the intron junctions. (B) Structure of 
the LAM (gold) and RRMl (green) of La protein bound to a 10-mer 
RNA ending in UUU-3'OH (orange) modified from PDB 2VON (71). 
The highly conserved P-sheet surface residues that extend from the 
RRMl binding surface of Slalp, Y157 and F201 mutated for this 
study, are highlighted in blue. Loop-3 which comprises part of the 
canonical RNA-binding site of RRMl (47) is also indicated. 



Construction of a customized yMWB3-15 reference 
sequence 

The 5. pombe genome used in this study originated froin 
the sequence obtained at Sanger Institute, dated June 2008 
(50). Modifications were introduced to better reflect the 
strains used in this study following a first pass of read 
alignment. SNP and insertion/deletion modifications 
noted by the Broad Institute (51) were incorporated 
after identifying identical mutations in the parent strain. 
pJK148 vector sequence including the suppressor tRNA 
insertion (49) was incorporated at the leul-32 locus as 
described (52). 

The four poly-N gaps in the Sanger reference were 
probed as follows. Gap-flanking sequences were used in 
a seeded search of unaligned reads. For the gap at Chr 2 
position 80202 this led to isolation of continuous sequence 
tracts on either side. A BLAST search identified the 



previously resolved ~18-kb gap-spanning sequence (53) 
which was inserted to replace the poly-N gap, and pre- 
dicted genes in this ~18kb were incorporated into the 
sequence annotations. Introduction of this sequence 
provided continuous coverage at near average depth 
across the region. As a result, some Chr 2 gene positions 
in our reference are offset by ~18 000 relative to the 
Sanger positions (Tables 1 and 4). Similar attempts to 
resolve the remaining three contig gaps (Chr 1: 29662; 
Chr 2: 1616568; and Chr 2: 1083349) were unsuccessful 
with this read data set. 

For some applications, we incorporated the complete 
mitochondrial DNA sequence (54,55) Schizosac- 
charomyces pomhe ad7-50h- (Project ID: 20755). 

Illumina Solexa sequencing and identification of mutations 

Sequencing of samples was according to standard 
methods. Unpaired 36-bp reads were collected and 
pooled under designation WT (yMWB3-15), MUT 
(yDA317) or TEST (yMWB2-4) for each. Our updated 
5". pombe reference genome was used in alignments per- 
formed using MAQ (Mapping and Assembly with 
Quahties) software (56) without any trimming. SNPs 
were called from sequence alignments. Deletions and in- 
sertions were identified and called by ahgnment of 
flanking sequence seeds to regions of zero coverage 
using the total read database, followed by verification by 
polymerase chain reaction (PCR) amplification of 
genomic DNA and conventional sequencing. 

Software and sequence analysis tools 

Reads were obtained and adjusted to Sanger FASTQ 
format and pooled for each strain dataset (57). MAQ 
software was utilized in ahgnment of reads and calling 
of SNPs (56). Ahgnments were manipulated with 
SAMtools (58). Programs used in our analysis were 
written primarily in Perl with some use of BioPerl 
modules (59) and are described and available for free 
download as the MATCH-G toolset at http://science 
.nichd.nih.gov/confluence/display/smcb/MATCH- 
G+Program or through the Maraia laboratory website 
http://science.mchd.nih.gov/confluence/display/smcb/ 
Home. Briefly, these programs performed the following 
tasks: assignment of SNPs to the reference genome and 
determination of resulting mutations with corresponding 
confidence scores; determination of regions corresponding 
to zero read coverage; seeded search and alignment of zero 
coverage flanking sequence regions for determination of 
small deletions and insertions; comparison of annotated 
feature average mapped read depth between data sets; and 
a sliding averaging window comparison of mapped read 
depth between data sets. 

Verification of mutations 

Mutations were verified from DNA prepared from 
acid-washed bead-lysed 5. pombe cells. Following basic 
phenol-chloroform extraction, lysate was treated with 
DNase-free RNase A and DNA was ethanol precipitated 
and quantitated by nanodrop. PCR amplification of 
regions of interest was performed and purified using a 
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QIAGEN QIAquick PGR cleanup kit. PGR products 
were sequenced by AGGT, Inc. (Glenview, 111) to verify 
mutations identified by lUumina sequencing. 

Determination of duplications 

Identification of regions of duplication was performed 
using mapped coverage depth by two approaches. In the 
first approach, for each annotated feature 'context' of the 
genome, the number of mapped reads at each position was 
summed and averaged over the full length of the 
annotated region. These averages were compared for 
yMWB3-15 and yDA317 to obtain a ratio corrected by 
a factor of 1.12 to reflect the difference in average mapped 
read depth for these strains. The ratios were compared at 
each feature to estabhsh statistical distribution for all 
annotated features, filtering out only those <50nt long, 
and copy number exceeding 1000 reads, to limit noise. 

In the second one, context-independent approach, we 
used a sliding averaging window of 400 bases and 
average mapped read depth was compared for 
yMWB3-15 and yDA317. Positions where the average 
depth difference exceeded 1.8-fold were identified as indi- 
cative of a duplication or loss of copy event and combined 
into continuous units. 

Intentional introduction of additional suppressor 
tRNA^"UCA-C47:6U gene 

The ~400-bp suppressor tRNA^'='UGA-G47:6U Pstl-SacI 
gene fragment (49) and ~ 150 bp of flanking DNA on 
either side was PGR amplified from the pJK148 vector 
used to create yWMB3-15, and hgated into pFA6A at 
the Sail and BamHI sites. pFA6A with and without the 
suppressor tRNA insert were linearized with Ndel and 
transformed into yMWB3-15 and yMWBl-1 in parallel. 
Transformed ceUs were plated onto YES media then 
replica plated onto YES + G418 to screen for integrants. 

Three G418-resistant transformed colonies from the 
empty pFA6A and five from the suppressor tRNA- 
containing pFA6A were isolated and streak purified. 
These were then spotted onto media with limiting 
adenine (lOmg/1) alongside yMWB3-15 and yDA317 for 
comparison to examine suppression phenotype. 

Northern blot and genomic DNA PGR verification of 
duplication 

Total RNA was prepared from cells and northern blotted 
as described (49) using 10% TBE-urea gels and iBlot 
transfer apparatus (InVitrogen). The RNA was 
transferred to a Genescreen-Plus membrane which was 
UV-cross-hnked and vacuum baked and hybridized with 
a ^¥-5'-GAGAGAGGGGATTAGATTTGAAG DNA 
complementary to spliced mature suppressor 
tRNA^'"'UGA-G47:6U in the presence of an equal 
amount of unlabelled oligo GGGAGAGGGGATTAGA 
TTTGAAG to block hybridization of the ^¥-5'-probe 
with the highly related endogenous tRNA^'^'UGA, a 
method known as homolog exclusion probing (60). After 
washing, the membrane was sealed, exposed and 
quantitated by Phosphorlmager. The membrane was 



stripped, re-hybridized with a U5 snRNA probe to stand- 
ardize quantitation. 

Genomic copy number of the suppressor tRNA gene 
was verified by semi-quantitative duplex PGR utilizing a 
single copy region of the genome (slal gene) for compari- 
son. Purified genomic DNA was incubated with two 
primer pairs (slalF: GGGAATATTGTTAGGATTTAA 
GGATT, slalR: AAGATTGTGTTGATGAGTAGG 
AAA, SUPF: GGGGGGGTGTTGGGTATTA, SUPR: 
GGGTGGTATGTTGTGTGG AATT) and amplified as 
follows (95°G 5min; 25 cycles at 95°G 45 s, 55°G 45 s, 
72°G 90s; 72°G 3min, 4°G). Twenty-five cycles had been 
demonstrated to be within the linear range of amplifica- 
tion for both primer sets (data not shown). Bands were 
stained with ethidium bromide, visualized by UV and 
quantitated using Vision WorksLS software. A ~700-bp 
band density corresponding to the suppressor tRNA was 
divided by the ~1000-bp band density corresponding to 
slal. The ratios for each of four independent PGR data 
sets were normalized using yMWB3-15 ratio as represent- 
ing a 1:1 copy number. 

Tandem duplication was detected by a PGR assay 
strategy (61) using four primers: (i) ATAAGGAAGGGT 
TGGGAGGA, (ii) TATGGGGTGAGAATTGGAGA, 
(in) GGAATTGGGGGTATAGTGAGTG and (iv) TGG 
ATGAAGATTGTAAATGGTAGG. The positions of 
primers 1-4 are indicated in Figure 4A. PGR was per- 
formed on purified genomic DNA in 25niM Tricine pH 
8.7, 85 mM KOAc, 8% glycerol, 1% DMSO and 1.2 mM 
Mg(OAc)2 using a mixture of polymerases 2U Tth 
(Promega) and O.IU Vent (NEB) in 50 ul. Gychng was 
95°G 5min; 30 cycles at 95°G 45 s, 55°G 45 s, 68°G 
7min; 72°G 3min, 4°G. Five microliters of each reaction 
was resolved on 1% agarose gel. 



RESULTS 

Isolation of spontaneous phenotypic mutants 

Our laboratory strains yAS95 and yAS99 containing 
ade6-704, were derived in 1996 after crossing spll90 (62) 
(a.k.a., yAS50) with spDJVl (63) derived from spB67, a 
diploid with the ade6-M210/ade6-216 alleles (64). The 
strain generated for the present study, yMWB3-15 
(parent) harbors ade6-704, which contains a UGA stop 
codon (65) that leads to accumulation of red pigment 
when grown in media with limiting adenine (66). 
Suppressor tRNA^^UGA can suppress this nonsense 
stop codon and lead to production of functional Ade6p 
enzyme, which suppresses red pigment. yMWB3-15 
contains the suppressor tRNA^'"'UGA-G47:6U allele 
which carries a nucleotide substitution in the variable 
arm, and the slal-rrm mutant allele slal-Y157A,F201A 
(Figure IB). The tRNA^'^'' substitution changes a G:G 
basepair to a G:U basepair in the tRNA^^'UGA variable 
arm. This substitution prevents tRNA^'^'UGA- 
G47:6U-mediated suppression in slal-A as well as 
slal-Y157,F201A cefls (32). yMWB3-15 harbors 
ade6-704, tRNA^'=''UGA-G47:6U and chromosomal 
slal-Y157A,F201A, and expresses Slalp-Y157A,F201A 
at wild-type levels (data not shown), but accumulates 
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Table 1. Single-nucleotide polymorphisms obtained by Solexa sequencing of parent, mutant and test strains 



Classification 


Pn^i ti nn 




Ref 


Test 


Peircnt 


Mut 


LOL' LIS 




1 est stram 


—Known 


i 1 Jo 1 J Jt) 




T 




T 


T 


/ 6 


Yes 


Test strain 


—known 




T 1 SI n 






Q 


Q 


n ip 


Yes 


Test strain 


—known 


T 1 "^Rl 71 Q 

1 1 JO I 1 17 


T 1 "^Sl 71 0 
1 1 JO illy 


VJ 


A 


u 


ri 




1 es 


Test strain 


-known 


T 1 'X'klC\Al 
i iJoZUM-Z 


1 IjoZUHZ 


T 
1 


T 


p 

l„ 






1 es 


Test strain 


-known 


i i JOZ 1 J 7 


T 1 '^R9] SO 
1 1 joZ i J" 






Q 


Q 




Yes 


Test strain 


-known 


T A'^Si.lll^ 

1 V 30Z.Z./.D 


1 IjoZZZJ 


^ 


T 
i 


P 


c 




1 es 


Test strain 


-known 


I 1382753 


I 1382753 


A 


A 


G 


G 


slal 


Yes 


Test strain 


-known 


I 1382837 


I 1382837 


A 


A 


G 


G 


slal 


Yes 


Test strain 


-known 


I 1382838 


I 1382838 


A 


A 


C 


C 


slal 


Yes 


Test strain 


-known 


I 1383602 


I 1383602 


A 


G 


A 


A 




Yes 


Test strain 


-known 


I 1384146 


I 1384146 


T 


C 


T 


T 


tim40 


Yes 


Test strain 


-unknown 


I 877349 


I 877349 


C 


C 


T 


T 


nop58 


Yes 


Test strain 


-unknown 


I 2277237 


I 2277235 


C 


C 


A 


A 




Yes 


Test strain 


-unknown 


I 1177692 


I 1177692 


A 


G 


A 


A 


dakl 


Yes 


Parent # mutant 


I 933736 


I 933736 


C 


C 


C 


T 


erg3 


Yes 


Base deletion 


II 621061 


II 603280 


T 








miscRNA 


Yes 



Position refers to our parent strain with roman numeral reflecting chromosome. "SangerPos' refers to the position as found in the Sanger Institute 
reference genome. The corresponding gene locus is noted, if any. Nucleotide identities at a given position are shown for four strains, reference 
yFSlOl (Ref), Test, Parent (WT, yMWB3-15) and mutant (Mut, yDA317). Multiple known mutations in our test strain at the nhp6-slal-tim40 locus 
region on chromosome I, spanning positions 1381336-1384146 served as a control. Each mutation was verified by conventional PCR-mediated 
sequencing. 



red pigment due to lack of tRNA-mediated suppression 
(TMS). 

yMWB3-15 was plated on limiting adenine media in 
order to isolate spontaneously arising white colonies. 
Multiple isolated colonies were purified and subjected to 
conventional sequencing of the tRNA^'='"-C47:6U, 
slal-Y157A,F201A and ade6-704 loci to screen for 
mutants that had not reverted these alleles to their func- 
tional counterparts (no reversions were found, data not 
shown). The mutant designated yDA317 was chosen for 
Solexa sequencing because it exhibited higher levels of 
mature suppressor tRNA^'"'UCA-C47:6U, than the 
parent strain (data not shown, see below). 



Whole genome Solexa sequencing reveals known 
SNP mutations 

yMWB2-4 is a strain generated during the development of 
yMWB3-15 that was known to harbor mutations in and 
around the slal"^ locus that differ from the parent strain. 
yMWB2-4 was subjected to whole genome sequencing in 
parallel with parent (yMWB3-15) and mutant (yDA317) 
to serve as a test and to help train our analytical methods. 
SNPs relative to the parent were determined blindly by 
MAQ alignment software (56) utilizing S. pomhe genome 
972h- (GenBank) with modifications (1,51) as the ahgn- 
ment reference (Table 1). This led to identification of all of 
the yMWB2-4 mutations at the nhp6-slal-tim40 locus 
(Table 1, known), as well as three additional differences 
(unknown), aU of which were verified by conventional 
sequencing. 

We then compared the parent (yMWB3- 1 5) and spon- 
taneous mutant yDA317 and identified only one SNP, 
which represents a synonymous mutation to valine 74 of 
the erg3'^ gene open reading frame (Table 1). Analysis of 
this SNP in erg3'^ predicted no significant change in valine 



codon usage or local RNA structure. Furthermore, no 
reverse strand RNA or open reading frame (ORF) was 
annotated nor predicted. Given the number of adventi- 
tious synonymous SNPs discovered in our test strain 
(and in yFSlOl, below), it seemed unhkely that the 
erg3'^ synonymous SNP caused the TMS phenotype in 
yDA317. We therefore sought to examine the whole 
genome sequences of our parent and mutant strains for 
mutations other than SNPs by further analysis of the 
Solexa data. 



Identification of a tRNA^"UCA-C47:6U gene 
duplication in yDA317 

Insertions and deletions were sought by identifying 
sequence reads that flanked regions of zero mapped 
coverage in the ref. genome, followed by verification by 
PGR sequencing. This approach identified a single base 
deletion in an annotated miscRNA, at chromosome 2, 
Sanger position T603280 (Table 1). However, this 
deletion relative to the reference was shared by the 
parent and mutant strains (Table 1 ) and is not responsible 
for the mutant phenotype. 

The next approach was to determine if a change in copy 
number of a gene or region had occurred, and, for this, 
two strategies were employed. At the positions of known 
and predicted genes, the mapped read depth was averaged 
across the entire length of the annotated feature, as an 
annotation context-dependent approach. These averages 
led to ratios of mapped read depth for the parent and 
mutant at each known transcription/annotation unit. 
These ratios were corrected shghtly to reflect the difference 
in average read depth for the entire genomes of the strains, 
40 for the parent and 45 for the mutant. When the ratios 
were plotted in a simple statistical distribution, the copy 
number varied along a standard bell curve at a mean of a 
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WT/Mutant Mapped Read Depth 




Figure 2. A simple statistical distribution of annotated gene copy number as estimated from mapped read depth. A Z-score of zero represents a 1:1 
ratio of wild type, (WT, yMWB3-15) and mutant (Mut, yDA317) read depth at an annotated genome feature. A negative Z-score reflects greater 
read depth or copy number in the mutant strain, and vice versa. The downward arrow indicates the suppressor tRNA^'''UCA-C47:6U gene, 
representing a 2-fold greater copy number in the mutant strain. 



Table 2. Annotation context-insensitive computational screen for change in copy number 



Chr 


Position 


Length 


MUT/WT 


WT Cover 


Mut Cover 


Locus 


1 


1552075 


43 


1.9 


40 


74 


USP (Universal Stress) family 


2 


1974364 


5809 


2.1 


54 


110 


Suppressor-tRNA + flanking DNA 


3 


565558 


108 


1.9 


30 


56 


Intergenic mkcRNAISPCC1672.03c 


3 


997392 


23 


1.9 


43 


83 


Intergenic {vcxljinacB) 



An averaging window of 400 nt was scanned across the three chromosomes (Chr) comparing average mapped read depth/coverage between parent 
(WT) and mutant (MUT) strains. 

Base positions exceeding a 1.8-fold change in coverage were flagged and combined into continuous units. 

The ratio of mutant/wild-type coverage is indicated (MUT/WT); the average coverage is indicated for WT and Mut. 



1-to-l ratio (Figure 2). One significant outlier, however, 
was in the number of reads mapping to the suppressor 
tRNA^"UCA-C47:6U sequence, suggesting gene duphca- 
tion. The position on the bell curve of this copy number 
variation was more than 4 SD away from the median, 
toward increased copy number in the mutant strain 
(Figure 2). 

The potential for gene duplication was then assessed by 
a second strategy that also yielded size information as part 
of a context-independent analysis of mapped read depth. 
For each base of all three chromosomes, the average read 
depth of that base and 200 bases on either side was 
compared between parent and mutant strains using a 
cutoff of >1.8 fold difference in read depth between 
strains. This approach identified four regions of ~2-fold 
increase in mapped reads in the mutant relative to the 
parent, two intergenic, one corresponding to a 43-bp 
region of the USP coding sequence and a large section 
(5.8 kb) corresponding to the suppressor tRNA^^UCA- 
C47:6U allele plus associated flanking DNA used for its 
prior integration in ySHlS, a precursor of yMWB3-15 
(Table 2). While both strategies identified few differences 
in copy number at loci other than the suppressor 
tRNA^"UCA-C47:6U gene, the others were not as well 



supported statistically as the tRNA^"UCA-C47:6U gene 
duplication (see 'Discussion' section), nor could they as 
readily explain the suppression phenotype. Thus, the 
tRNA^'=''UCA-C47:6U gene duplication was an excellent 
candidate as causative of the suppression phenotype. 

Deliberate introduction of an additional 
tRNA^"UCA-C47:6U gene into yMWB3-15 
produces suppression phenotype 

Since two analytical approaches suggested duplication of 
the suppressor tRNA^"UCA-C47:6U gene, we tested if 
introduction of an additional copy of this gene would 
result in the suppression phenotype. The parent strain 
was transformed by chromosomal integration of hnear- 
ized kan^ -contaimng pFA6A vector harboring either a 
suppressor tRNA^"UCA-C47:6U insert of -300 bp, or 
no insert. Transformed cells were plated on YES, then 
replica plated onto YES + G418. G418-resistant colonies 
were randomly picked from both the no-insert and the 
suppressor tRNA^"UCA-C47:6U-insert transformants, 
streak purified and transferred to limiting adenine for 
comparison to each other, parent and mutant, for TMS 
phenotype. None of the three analyzed transformants 
derived from the empty vector showed suppression 
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Figure 3. Deliberate introduction of an additional chromosomal copy of the suppressor tRNA^°'UCA-C47:6U gene into yMWB3-15 is sufficient to 
produce the suppressed phenotype with increased tRNA^'"^UCA-C47:6U levels. (A) Suppression phenotypes of parent and mutant strains as 
compared to parent transformed with either the integrated vector without a suppressor tRNA^'"^UCA-C47:6U gene (Parent + empty vector, 3 out 
of 3 isolates shown), or the integrated vector with the suppressor tRNA^"UCA-C47:6U gene insert (Parent + suppressor-tRNA, 5 out of 5 isolates 
shown) (see text). (B) Northern blot of RNA purified from various strains and probed for mature suppressor tRNA^'''^UCA-C47:6U (upper panel). 
Parent, mutant and one each of the "parent + empty vector' and 'parent + suppressor-tRNA' strains from (A) are shown: yAS99 is a negative control 
strain that does not carry a suppressor tRNA^"='UCA gene. Lower panel shows the same blot probed for U5 snRNA used as a loading control for 
normalizing quantitation; the ratio of tRNA/U5 snRNA is indicated below the lanes. (C) Semiquantitative PCR of the suppressor 
tRNA^"UCA-C47:6U gene and the slal^ locus used here as a single copy gene control. After ensuring that amplification was in the quantitative 
linear range of the assay (data not shown), duplex PCR was performed under linear quantitative conditions. Band densities were quantitated and 
normalized to parent, and indicated under the lanes; numbers in parentheses reflect the range obtained from quadruplicate experiments. 



(Figure 3 A, row 3). By contrast, all five analyzed 
transformants derived from the suppressor 
tRNA^"UCA-C47:6U gene clearly demonstrated suppres- 
sion greater than in the parent strain yMWB3-15 
(Figure 3A, compare rows 1 and 4) and to a similar 
degree as in the spontaneous mutant yDA317 
(Figure 3 A, row 2). This demonstrated that introduction 
of a suppressor tRNA^'"'UCA-C47:6U gene into our 
parent strain results in a phenocopy of the mutant pheno- 
type. Moreover, although yDA317 contained an extra 
tRNA^"UCA-C47:6U gene as well as associated pJK148 
vector DNA, the present result isolated the phenotypic 
segment to the extra tRNA^'"'UCA-C47:6U gene. It also 
showed that the synonymous erg3-SNP in yDA317 was 
not required for the suppression phenotype. The data 
argue that the suppressor tRNA^'"'UCA-C47:6U gene du- 
plication detected by whole genome sequencing is respon- 
sible for the suppression phenotype in the yDA317 
mutant. 

Elevated tRNA^"UCA-C47:6U levels in TMS mutant 
yDA317 

As noted in the 'Introduction' section, preliminary char- 
acterization indicated that yDA317 exhibited higher levels 
of tRNA^"UCA-C47:6U than the parent, yMWB3- 15. To 
evaluate if introduction of an extra tRNA UCA-C47:6U 
gene also led to elevated tRNA^'"'UCA-C47:6U levels, we 
performed northern blots. Figure 3B demonstrates 
increased suppressor tRNA^'""UCA-C47:6U levels in 
both yDA317 mutant and yMWB3-15 transformed with 



the suppressor tRNA^'"'UCA-C47:6U gene, relative to the 
nontransformed parent strain. Using U5 snRNA for cali- 
bration, suppressor tRNA^'"'UCA-C47:6U levels were re- 
producibly elevated roughly 8-fold in the yDA3 1 7 mutant 
relative to the yMWB3-15 parent (Figure 3B, compare 
lanes 2 and 3). Similar increase occurred in yMWB3-15 
transformed with the suppressor tRNA^"'UCA-C47:6U 
gene (lane 5) but not with the vector only (lane 4). 

Verification of the additional copy of the suppressor 
tRNA^"UCA-C47:6 gene 

This verification was achieved by semi-quantitative duplex 
PCR amplification of genomic DNA at the suppressor 
tRNA^"UCA-C47:6U and the slctl loci, the latter as a 
single copy gene control in the same reaction 
(Figure 3C). PCR products from four independent reac- 
tions were quantitated for the tRNA^""UCA-C47:6U 
bands and normalized to the slal PCR bands in the 
same lanes with the results shown below the lanes of 
Figure 3C. The tRNA^""UCA-C47:6U fragment was 
2-fold increased in yDA317 relative to the yMWB3-15 
parent strain (lanes 2 and 3), consistent with estimation 
derived from the Solexa sequence read coverage. A similar 
increase was observed for the tRNA-transformed parent 
(Figure 3C, lane 5) relative to the empty vector- 
transformed parent strain (lane 4). The cumulative data 
estabhsh that a suppressor tRNA^"UCA-C47:6U gene du- 
plication causes elevated tRNA^"UCA-C47:6U levels and 
increased tRNA^"UCA-C47:6U-mediated suppression. 
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Figure 4. Partial duplicated state of the pJK148»tRNA^"UCA-C47:6U»/ei;/ locus in the yDA317 mutant. (A) Overview of the 
pJK148»tRNA^"UCA-C47:6U»/fi/; locus in yMWB3-15 (upper) and yDA317 (lower), illustrating the relative positions of PCR primers (horizontal 
arrows labeled 1-4) used in (B). DNA lengths between the primer pairs are indicated in bp. Lower case t indicates position of the 
tRNA*^"UCA-C47:6U gene. The horizontal line connecting the two gray-filled triangles reflects the region duplicated in yDA317. A key depicting 
different components of the loci is shown in lower right. (B) EtBr stained gel of PCR products obtained with the different primers indicated under 
the lanes, using genomic DNA from yMWB3-15 (lanes 1-7) and yDA317 (lanes 8-14) as template. Primers 2 and 3 would produce a product only 
from the partially duplicated locus. Size markers are indicated on the left and right in kb. 



Tandem duplication of the tRNA^"-C47:6U gene and 
surrounding DNA 

The Solexa data indicated duplication of the suppressor 
tRNA gene and flanking DNA. We suspected a tandem 
duplication and tested this using a four-primer PCR 
approach (61). Genomic DNA was amplified using differ- 
ent combinations of four primers as diagramed in 
Figure 4A. The expected PCR fragments from primer 
pairs 1, 2 and 3, 4 should occur regardless of copy 
number and can be considered as positive controls for 
the primers (Figure 4B, lanes 1, 5, 8 and 12). Primer 2 
alone or primer 3 alone would produce products only if 
tandem duplication occurred in opposite orientation, 
which was not observed (Figure 4B, lanes 6 and 7, and 
13 and 14). However, combining primers 2 and 3 
produced a ~5.2-kb fragment from yDA317 but not 
yMWB3-15 (Figure 4B, compare lanes 3 and 10). This 
fragment would arise only if tandem duplication 
occurred in the same orientation. As confirmation, this 
5157-bp fragment was subjected to standard sequencing 
and found to have the anticipated sequence including an 
intact leul* (data not shown). (Although yDA317 
contains two leul* genes these cells do not exhibit 
growth advantage over yMWB3-15, which contains a 
single leul^ gene, in media lacking leucine, data not 
shown.) From these data, it appeared that the 
yMWB3-15 region demarcated by the gray filled triangles 
(Figure 4A) containing the suppressor tRNA gene was 
duplicated in yDA317. 



tRNA gene duplication as a common source of our 
spontaneous TMS mutants 

Using the PCR tRNA gene duplication assay we examined 
genomic DNA from 30 additional mutants isolated at the 
same time as yDA317 and stored at — 80°C. Upon 
reevaluation, three of the mutants had weak or no sup- 
pression phenotype and exhibited a single copy 
tRNA^"UCA-C47:6U gene. Of the remaining 27 sup- 
pressed mutants, 21 had two copies of the 
tRNA^'"'UCA-C47:6U gene and four had three or more 
copies. Thus, 25 of 28 spontaneous mutants (90%) with 
increased TMS had undergone tRNA^'=''UCA-C47:6U 
gene duplication or additional amplification. The three 
suppressed mutants with a single tRNA^"UCA-C47:6U 
gene did not express elevated levels of tRNA 
"UCA-C47:6U (data not shown) and are thus of ques- 
tionable interest. 

The slal-rrm allele is required for TMS in the tRNA 
gene duplication mutant 

The northern blot data were somewhat surprising because 
prior experience with TMS suggested that an increase in 
suppressor tRNA^^UCA levels necessary to cause a TMS 
phenotype would be greater than a 2-fold increase 
expected from a gene duplication. 

We considered two models by which a tRNA 
^"UCA-C47:6U gene duplication can lead to a dispropor- 
tionately large increase in tRNA^'=''-C47:6U levels, which 
could potentially be distinguished experimentally. It was 
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Table 3. Strains used in this study 



Strain 


Genotype 


Phenotype 


Details 


ySH18 


h- 


ade6-704. slal-A, ura4-D, leul-32::tRNA U47:6-leul + 


Red 


Progenitor 


yMWB2-4 


h- 


ade6-704, slal+::slalY157A-kanMX6, ura4-D. leul-32::tRNA U47:6-leul+, 


Pink 


Test strain 


yMWB3-15 


h- 


ade6-704, slal+::slalY157A/F201A-kanMX6, ura4-D, leul-32::tRNA U47:6-leul+ 


Red 


Parent strain 


yMWBl-1 


h- 


ade6-704. slal-A::kanMX6, ura4-D, leul-32::tRNA U47:6-leul+ 


Red 


Isogenic slal-A test strain 


yDA317 


h- 


ade6-704, slal+::slalY157A/F201A-kanMX6, ura4-D, leul-32::tRNA U47:6-leul+ 


White 


Mutant strain 


yFSlOl 


h- 




White 


Ref strain 



assumed for both models that the increase in 
pre-tRNA^"UCA-C47:6U transcription that results 
from gene duplication would overcome a concentration- 
dependent process that would allow some of the 
pre-tRNA^"UCA-C47:6U to evade decay for at least 
long enough to be engaged by a more productive matur- 
ation process that promotes accumulation. In the first 
model, a 2-fold increase in the concentration of nascent 
pre-tRNA substrate might enhance the rate of association 
with the Slal-Y157A,F201A protein, whereas in the 
second model, the increase in pre-tRNA might exceed 
the capacity of a degradative enzyme. The former was 
more appealing for multiple reasons including because as 
noted in the Introduction' section, Slalp-rrm is expected 
to exhibit only a subtle deficiency in RNA binding relative 
to wild type Slalp (32), which might be over ridden by a 
2-fold increase in the concentration of nascent 
pre-tRNA^"UCA-C47:6U hgand. In the second model, 
the increased pre- tRNA^"UCA-C47:6U substrate might 
exceed the capacity of a degradative nuclease in the 
absence of Slalp-rrm. It might be expected that in the 
absence of Slalp more pre-tRNA^"UCA-C47:6U would 
be targeted for decay than in its presence, and the 
increased pre-tRNA produced from a second copy of 
the tRNA^"UCA-C47:6U gene would further exceed the 
capacity of the decay pathway and lead to an increased 
TMS phenotype. 

We did an experiment to ask if slal-rrm was required 
for the TMS phenotype upon increase in tRNA 
^'''^UCA-C47:6U gene copy number. We examined 
yMWBl-1 {slal-A) which is isogenic with yMWB3-15 
except that it is devoid of Slal-rrm protein (Table 3 and 
data not shown). Neither yMWBl-1 nor yMWB3-15 
exhibits TMS (Figure 5A, row 1). yMWBl-1 was trans- 
formed with empty vector or vector containing the 
tRNA^'=''UCA-C47:6U gene, and then examined for 
TMS, with parallel transformation of yMWB3-15 to 
serve as control. Individual transformants were picked 
randomly and then examined for TMS (Figure 5A, 
rows 2-5). Introduction of vector alone or vector plus 
additional copy of suppressor tRNA gene into 
yMWBl-1 did not produce suppression (Figure 5A rows 
4 and 5). This was in contrast to yMWB3-15 
{slal-Y157A,F201A) into which an extra copy of the sup- 
pressor tRNA gene produced increased TMS in all 
five transformants, whereas empty vector did not 
(Figure 5A, rows 2 and 3, respectively). These data 
would appear to distinguish between the two models con- 
sidered above and suggest that the mechanism by which 
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yMWB3-15 + suppressor-tRNA 
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Figure 5. The .?/<7y-Y157A,F201A allele is required for TMS due to 
tRNA gene duplication. (A) TMS phenotypes of parent (yMWB3-15), 
mutant (yDA317) and isogenic test (yMWBl-1) strains (row 1) as 
compared to yMWB3-15 transformed with empty vector (row 2), or 
with the suppressor tRNA^"^UCA-C47:6U gene insert (row 3) and 
yMWBl-1 transformed with empty vector (row 4), or with suppressor 
tRNA^'-''UCA-C47:6U gene insert (row 5). (B) yMWB3-15 transformed 
with empty expression vector pRep4x (row 1), pRep4x-s/i7/^ (row 3) 
and pKfgA\-slal-Y157A,F201A (row 3). Five transformants of each are 
shown. 

tRNA^'='UCA-C47:6U gene duplication leads to the 
observed TMS phenotype requires Slal-rrm protein (see 
'Discussion' section). 

The data are consistent with the expectation that an 
increase in nascent pre-tRNA^*" UCA-C47:6U produced 
from the second copy gene would be enough to increase 
its rate of association with the Slalp-rrm, with resulting 
positive effects on TMS. If so, we might expect that 
overexpression of ectopic Slalp-rrm in yMWB3-15 
would lead to TMS. Indeed, ectopic expression of either 
wild-type slat or slal-Y157A,F201A, led to TMS 
(Figure 5B). The data in Figure 5 support the contention 
that increasing the association of pre-tRNA 
^'=''UCA-C47:6U with Slalp-Y157A,F201A, is sufficient 
to lead to TMS. 

Accumulation of 24 nuclear DNA mutations in separated 
S. pomhe strains 

The original Sanger Institute sequence assembly of 
972h- (1) was recently compared to that obtained using 
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Table 4. Summary of polymorphisms found in our parent strain, yMWB3-15, by Solexa sequencing and the 5. pomhe reference genome, yFSlOl 
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Chr = chromosome, 1,2,3 or mitochondria (Mt). Cover = coverage. 

Resulting coding change, if any is noted in the 'Mutation' column, *'intergenic, "'synonymous mutation in coding region, ""'-intronic, '^-ade6-704 allele 
mutation confirming previous sequencing (65), "-intronic based on revised splice prediction (72), '^-atp6 frameshift mutations result in two amino acid 
mutations leaving 110 unchanged. 

Each nuclear SNP was verified by conventional PCR-mediated sequencing and the identities are noted in the rightmost columns. 



Solexa technology for a single 972h- isolate 
designated yFSlOl (51). We compared our sequence to 
the yFSlOl sequence obtained by Solexa technology 
and identified 24 differences (Table 4). At each of 
these positions, yDA317 and yMWB3-15 were identical. 
We prepared DNA from yFSlOl (kindly provided by 
N. Rhind) and verified these differences in yMWB3-15 
and yFSlOl in parallel by conventional sequencing. 
The 24 SNPs consisted of four coding mutations 
that changed identities of single amino acids in each of 
four protein sequences, one miscRNA of unknown 
function, four synonymous coding and 14 intergenic 
or intronic mutations (Table 4). In addition to these, the 
Solexa data confirmed mutation in ade6-704 allele in 
our strain, a T^A that changes a cysteine codon at 
position 215 of Ade6p to a stop codon (Table 4) as 



reported (65), as well as mutations in the ura4 locus 
(data not shown). 

Mitochondrial mutations 

After estabhshing our modified reference genome for 
yMWB3-15 and obtaining and verifying the nuclear 
DNA SNPs relative to yFSlOl, we further modified it 
by incorporating the available 5. pombe mt DNA refer- 
ence sequence (54,55). This had two remarkable effects on 
the analysis. Overall mapped read coverage increased 
from 52% to 96%, with 44% of the reads mapping 
to the mtDNA. Accordingly, read depth for the mt 
DNA was uniformly very high, with a range of 
5000-10000 coverage (e.g. see Table 4). Assuming -100 
copies of mtDNA per 5. pombe cell (67), this degree of 
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observed coverage is congruent with our nuclear DNA 
coverage. 

With a cutoff of 85% identity, we observed 13 
SNPs and two single-nucleotide insertion/deletions in 
the mtDNAs of the reference mtDNA sequence (54,55) 
as compared to our parent and mutant strains 
(Table 4, Mt). In contrast to the observed SNPs in 
the nuclear chromosomes, the majority of these mt 
mutations result in coding mutations. The source(s) of 
these mtDNA differences is unknown (see 'Discussion' 
section). 

We then examined the available yFSlOl data set which 
had not previously been analyzed for mtDNA (51), 
comparing it to the reference mtDNA (54,55). The same 
set of 13 SNPs and two insertion/deletions was identified 
in yFSlOl as in our strain, reveahng 100% identity of 
yMWB3-15 and yFSlOl throughout their mtDNA 
(Table 4, Mt). In conclusion, our strain and yFSlOl 
differ in 24 SNPs in their nuclear DNA but are identical 
in their mtDNA. 



DISCUSSION 

One conclusion that can be drawn from this work is 
that comparative whole genome sequencing of S. pombe 
can be used to detect and map mutations in spontaneous 
mutants selected for a phenotype, in this case related 
to tRNA expression, after one plating. The analytical 
tools described here can detect insertions, deletions, 
differences in gene copy number and SNPs. This led 
to the discovery that duplication of a suppressor 
tRNA^'='UCA-C47:6U gene is responsible for the 
selected phenotype. 

To compare our strains with each other we first 
generated an electronic version of our parent strain 
genome using the original S. pombe reference as a 
platform (1). Having this in place, we could then 
compare our strain with the reference 972h- yFSlOl 
genome that had been recently updated using Illumina 
(Solexa) technology (1,51). This revealed that our strain 
differs from yFSlOl by only 24 SNPs (in addition to 
expected difference in marker alleles such as ade6-704) 
(Table 4) (65). 

A second conclusion is that the RNA chaperone-like 
activity of La protein appears to be able to activate its 
target pre-tRNA ligands for subsequent accumulation of 
the mature functional tRNA. As will be discussed in more 
detail below, consistent with other RNA chaperones that 
function via low-affinity and nonspecific interactions with 
their RNA ligands, our data suggest that the deficiency in 
nonspecific RNA binding activity of the mutated Slal-rrm 
protein can apparently be overcome by the increase in 
nascent pre-tRNA that would result from a 2-fold 
increase in gene dosage. Although a 2-fold increase in a 
nascent pre-tRNA may appear modest, the effects should 
be considered in the context of the increased concentration 
of a single pre-tRNA in a diverse mixture of other pre- 
tRNAs that compete with variable affinity for La. 
Accordingly, as a result of a 2-fold increase in suppressor 
pre-tRNA it should more competitively associate with the 



Slal-rrm protein and thereby benefit from its chaperone- 
hke activity. 

tRNA gene duplication 

Our genetic screen was designed to select for the appear- 
ance of mutants that exhibited a gain-of-function in 
codon-specific TMS of ade6-704. Indeed the output of 
the screen reported here was found to be an increase in 
the codon-specific TMS which occurred in association 
with increased suppressor tRNA^'"'UCA-C47:6U levels. 
Analysis led to the discovery that duphcation of the 
suppressor tRNA^'=''UCA-C47:6U gene is responsible for 
the selected phenotype. Although at first glance this 
might appear to have been likely, it was unexpected for 
the following reasons. First, on the basis of other genetic 
screens of yeast, SNPs appear to be more often uncovered 
than gene duplication. Second, prior and current evidence 
indicate that the deficiency in tRNA^'"'UCA-C47:6U ex- 
pression in our parent strain is at the posttranscriptional 
level, due to degradation of the precursor-tRNA in the 
absence of protection by La activity-2 (32). Thus, our ex- 
pectation was that we might find a debilitating mutation in 
a gene that contributes to a surveillance-related degrad- 
ation of the pre-tRNA^'='UCA-C47:6U in the absence of 
La activity-2. Finally, prior experience suggested that the 
mcrease of tRNA^^UCA levels required to cause 
increased TMS phenotype was anticipated to be greater 
than the 2-fold increase expected from a gene duplication. 
On this issue, we were surprised by the disproportionately 
high (8-fold) level of increase in mature suppressor 
tRNA^'='UCA-C47:6U in yDA317 and upon intentional 
introduction of a second copy of the tRNA 
^'=''UCA-C47:6U gene. 

Evidence that nonspecific, low-affinity interactions 
contribute to the RNA chaperone-like activity 
of La protein 

We hoped to identify a trans-d^ciing factor whose mutation 
would allow the structurally defective pre-tRNA^'^'^- 
C47:6U to overcome decay as in wild type slal"^ cells. 
Surprisingly, tRNA^'"'UCA-C47:6U gene duplication led 
to disproportionately high levels of tRNA^""UCA-C47:6U 
accumulation. An initial idea that the tandem 
arrangement of tRNA^'='UCA-C47:6U genes in yDA317 
contributes to activated transcription due to increased 
local concentration of pol III was dismissed after 
finding that non-targeted introduction of a second 
tRNA^"UCA-C47:6U gene also led to increased suppres- 
sion in all of the multiple transformants examined, with a 
5-8-fold increase in tRNA^"UCA-C47:6U levels 
(Figure 3, data not shown). Toward gaining insight into 
the mechanism by which the gene duplication led to un- 
expectedly high levels of tRNA^"UCA-C47:6U we 
determined that this was dependent on the presence of 
slal-rrm allele which carries the Y157A,F201A substitu- 
tions (or wild type slal^) since introduction of an extra 
copy tRNA^"-C47:6U gene into the slal-null strain 
yMWBl-1 did not cause suppression (Figure 5A). 
Likewise overexpression of slal-Y157A,F201A (or wild 
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type slal ) also led to increased TMS in yMWB3-15 (and 
yMWBl-1, not shown) (Figure 5B). 

The question is how a 2-fold increase in synthesis of 
a structurally deficient pre-tRNA, as would be expected 
to occur upon gene duplication, can lead to a greater-fold 
increase in tRNA accumulation in a La-dependent 
manner. We believe that a 2-fold increase in nascent 
pre-tRNA is sufficient to overcome the shght decrease 
in the RNA-binding affinity of Slalp caused by 
the Y157A,F201A mutations to the P-sheet surface 
of RRMl. By overcoming this binding deficiency, 
the pre-tRNA^"UCA-C47:6U would benefit from the 
RNA chaperone-like activity of Slalp and evade nu- 
clear decay during a critically vulnerable period in its 
maturation. Avoiding decay during a vulnerable period, 
even transiently, could allow a fraction of the 
tRNA^'"'UCA-C47:6U to accumulate, e.g. by export to 
the cytoplasm or other locale away from the nuclear 
decay system. 

To better understand the Slalp-rrm related aspects of 
this model, some of the RNA-binding characteristics of La 
protein and other RRM proteins as well as RNA chaper- 
one activities should be considered. Different RRM- 
containing proteins use their P-sheets and connecting 
loops in a variety of ways to engage their target RNAs 

(68) . In some RRM proteins, the aromatic residues 
equivalent to Slalp Y157 and F201 interact with RNA 
via stacking in a sequence-nonspecific manner and con- 
tribute significantly to overall affinity, whereas in others 
their contribution is much less (68). For La protein, the 
LaM confers high-affinity UUU-3'OH recognition, while 
the RRMl P-sheet surface appears to confer nonspecific 
binding to other regions of the pre-tRNA as might be 
expected for interaction with a variety of RNAs that 
differ in sequence (47). Whereas mutation of the conserved 
aromatic residues on the RRMl P-sheet of some other 
RRM proteins decrease RNA binding, for La RRMl no 
significant reduction in pre-RNA binding was observed 
in vitro, although the negative effects on TMS were 
apparent in vivo (32). Thus, mutation of the RRMl 
P-sheet aromatic residues appears to minimally decrease 
the nonspecific RNA-binding activity of La protein 
(32,47). Accordingly, a 2-fold increase in the concentra- 
tion of pre-tRNA ligand may be sufficient to increase the 
rate of association with Slalp-Y157,F201 and overcome 
this minimal deficiency. Consistent with this model is that 
nonspecific, low-affinity interactions indeed characterize 
other RNA chaperones that assist RNAs during folding 

(69) . That slalp-rrm confers this activity advances our 
understanding of the RNA chaperone-like activity of La 
protein. 

Finally, as noted in the third paragraph of the 
'Discussion' above, we should also consider the effect of 
a 2-fold increase in the concentration of a single pre-tRNA 
on the potential competitive strength for La binding in 
the presence of all of the other nascent pre-tRNAs. 
Accordingly, a 2-fold increase in nascent suppressor 
pre-tRNA should confer better competition for the 
Slal-rrm protein and thereby afford it the benefit of its 
chaperone-like activity. 



Genetic 'mutation' in the 5'. pomhe strains examined here 
is relatively low 

The number of discrepancies detected in our parent and 
mutant strain was low, consisting of a synonymous SNP 
and a duplication of > 500 bp, the latter of which was 
causative of the selective phenotype. We note that in 
addition to the suppressor tRNA gene, three other duph- 
cations were detected (Table 2), although the relatively 
small size of these regions at 23, 43 and 108 bp, and the 
intergenic locale of two of them, made them unlikely can- 
didates as causative of the phenotype. Moreover, for these 
loci the ratio of mutant-to-parent reads was <2.0, whereas 
for the suppressor tRNA gene the ratio was >2.0. 
Perhaps, more significant, the read depth or coverage at 
each of these three loci was lower than for the suppressor 
tRNA gene (Table 2), and likely result from relative 
mapped read depth noise exceeding the averaging 
window. On the basis of the last two points, we suspect 
that these may not reflect true duplications in the yDA317 
strain, although we have not attempted to verify this. 

The original sequence assembly of 972h- (1) was 
recently compared to that obtained using Solexa technol- 
ogy for a single 972 isolate designated yFSlOl (51). In that 
case, 200-fold coverage provided a very low false positive 
rate. Significantly, 190 discrepancies were observed and 
noted to be possibly due to errors in the reference 972 
assembly, sequence polymorphisms in isolates of 972 or 
systematic errors (51). Our sequence matched the yFSlOl 
sequence in all of these previously noted discrepancies, but 
did not match either the original reference nor the yFSlOl 
at 24 other positions (Table 2). At each of these 24 pos- 
itions, yDA317 and yMWB3-15 were identical. These dif- 
ferences in yMWB3-15 and yFSlOl were verified by PCR 
of the loci from both strains (yFSlOl provided by N. 
Rhind) and conventional sequencing (Table 4, last two 
columns). 

The 24 SNPs observed in our strain and yFSlOI con- 
sisted of five coding mutations that changed the identity of 
an amino acid in a protein sequence, one miscRNA of 
unknown function, four synonymous coding and 14 
intergenic or intronic SNPs (Table 4). Functional conse- 
quences of these SNPs are unknown. These could, in some 
cases, reflect mutations that confer a phenotype or be 
completely neutral, we do not know. 

Of the four coding changes, we note the following. For 
Rpb2p, L872R changes a conserved Leu in the several 
yeast sequences examined and therefore appears to be a 
mutation in our strain or its predecessor. For Puf3p, 
I709M occurs close to the end of the protein; the sur- 
rounding sequence is highly conserved among 
Schizosaccharomyces species (although not S. cerevisiae 
or human) and, therefore, appears to be a mutation in 
our strain or its predecessor. 

Twenty-four SNPs observed in two strains that were 
separated about 30 years ago suggest genetic drift or 
possibly inadvertent selection if a SNP conferred a pheno- 
type. In any case, we would caution against the use of this 
data for the purpose of calculating a 'mutation rate' 
because the length of time these strains have been continu- 
ously growing (versus frozen storage) is unknown as are 
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other aspects of their propagation. Nonetheless, the per- 
spective that arises is that the overall mutation burden 
in laboratory strains of S. pombe appears not to be very 
high. Indeed, it is low enough to make whole genome 
sequencing of spontaneous mutants a plausible endeavor. 

We note that this study is to be distinguished from a 
previous report of whole genome sequencing for mutant 
mapping in S. pombe (70). In that case, sequencing was 
done after cells were mutagenized with 7V-methyl-A^'-nitro- 
A^-nitrosoguanidine (MNG) and the mutant was isolated. 
Comparative analyses led to a relatively large number of 
SNPs, 73 of which were present in the selected mutant that 
had been mutagenized (70). The presence of so many 
associated mutations required a significant amount of 
genetic techniques before the selected phenotype could 
be attributed to the causative mutation (70). By 
contrast, our analyses of whole genome sequencing after 
spontaneous mutation led to one synonymous SNP plus a 
gene duplication that was shown to be causative of the 
selected phenotype, with relatively far less requirement 
for genetic techniques. 

That we isolated a tRNA gene duplication rather than 
SNPs as a common mutation (90%) in our TMS mutants 
may reflect an idiosyncrasy of our genetic screen, or 
perhaps a characteristic of tRNA genes. As noted above, 
SNPs would appear to be more often uncovered from 
genetic screens of yeast than gene duplication. However, 
since mutants obtained from genetic screens involve char- 
acterization prior to detailed analysis, it seems possible 
that typical prerequisite examination might disfavor char- 
acterization of gene duplication mutants. The abihty to 
map phenotype causative mutations by whole genome 
sequencing without the need for extensive prerequisite 
characterization may allow a wider variety of mutations 
to be isolated. 

Software availability and accessibility 

The toolset of programs compiled for this purpose is 
designated MATCH-G (Mutational Analysis Toolset 
Comparing wHole Genomes) and is available with de- 
scription and instructions for free download at http:// 
science.nichd.nih.gov/confluence/display/smcb/MATCH- 
G+Program. Some of the programs preexisted and some 
were developed specifically for this purpose. Use requires 
minimal basic computer knowledge (folder structure, file 
formats) and is written to be able to scale to any size 
genome for the purpose of finding differences between 
two sequence data sets where a pre-existing related refer- 
ence genome exists. It is run either from the command 
prompt or with a simple graphic-user interface. No 
coding or knowledge of programming languages is 
required for use. Use in MacOSX or UNIX-hke environ- 
ments is most straightforward where terminal windows 
and Perl are present by default. Use in windows operating 
systems may be more complicated. 

CONCLUDING REMARKS 

Our work demonstrates that whole-genome sequencing of 
S. pombe has the ability to detect single-nucleotide 



mutations, base insertions and deletions, and changes in 
gene copy number. The tools developed here should be 
applicable toward identifying factors and pathway com- 
ponents in high-throughput screens that produce spontan- 
eously arising mutants. This work expands the tools 
available for the 5*. pombe community. The tools de- 
veloped here for comparative whole genome sequencing 
could readily be applied to S. cerevisiae and other model 
systems. 



SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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